According to Artificial Analysis, a third-party AI testing agency, evaluating OpenAI's o1 inference model across seven popular benchmarks costs $2,767.05, while its non-inference model GPT-4o costs only $108.85. This significant disparity sparks discussion regarding the sustainability and transparency of AI evaluation. Inference models, AI systems capable of step-by-step reasoning to solve problems, while excelling in specific domains, incur significantly higher benchmarking costs than traditional models. Arti...